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PATENT 

ATTORNEY DOCKET NO: 06632/01 1001 



PRODUCTION OF BIQFILAMENTS IN TRANSGENIC ANIMALS 

5 Background of the Invention 

The utility of spider silk proteins as "super filaments" has led to attempts to 
produce recombinant spider dragline silks (Prince et al. Biochemistry 34: 10879-10885, 
1995; Fahnestock and Irwin, Appl. Microbiol. Biotechnol. 47: 23-32, 1997; Fahnestock 
andBedzyk, Appl. Microbiol. Biotechnol. 47: 33-39, 1997). Although successful 

10 expression of spider's silk has been demonstrated in E. coli, significant problems remain, 
such as instability of the sequences used due to recombination and rearrangement in the 
repetitive areas of the gene, inefficient transcription, translational pausing and premature 
termination of synthesis resulting in limitations on the length of the silk that can be 
produced efficiently (less than or equal to 1000 amino acids), low protein yields (4 to 300 

1 5 mg/liter), and low solubility of the produced fiber. 

Production of synthetic spider dragline silk protein in the methylotropic yeast 
Pichia Pastoris was superior to that oiE, coli in that longer proteins of at least 1600 
amino acids in length could be produced efficiently with no observed size heterogeneity 
due to premature termination of synthesis and stability of the repetitive sequences in the 

20 genes being expressed. Yields of protein in the Pichia system were 663 mg/liter; 
however, only 15% of this was in a soluble form in the yeast cell lysate. 



Summary of the Invention 
We have discovered a method for producing biofilaments (e.g., spider silk protein) 
in transgenic animals. This method greatly facilitates the production of these proteins, 
25 which are useful as "super filaments." 



In a first aspect, the invention features a nucleic acid molecule that includes (i) a 
nucleic acid sequence encoding a biofilament, (ii) a promoter that directs expression of a 
polypeptide in milk-producing cells or urine-producing cells, where the promoter is 
operably linked to the nucleic acid sequence, and (iii) a leader sequence that enables 
5 secretion of the biofilament by the milk-producing cells or the urine-producing cells, into 
the milk or urine of a mammal, respectively. 

In a second aspect, the invention features a mammalian embryo whose nucleus 
contains a nucleic acid molecule that includes (i) a nucleic acid sequence encoding a 
biofilament, (ii) a promoter that directs expression of a polypeptide in milk-producing 

10 cells or urine-producing cells, where the promoter is operably linked to the nucleic acid 
sequence, and (iii) a leader sequence that enables secretion of the biofilament by the milk- 
producing cells or the urine-producing cells, into, respectively, milk or urine of the 
mammal that develops from the nucleic acid molecule-containing embryo. Preferably, 
the nucleic acid molecule is inserted by artifice. 

15 In a third aspect, the invention features a female mammal in which the genome of 

the mammary tissue of the female mammal contains a nucleic acid molecule that includes 
(i) a nucleic acid sequence encoding a biofilament, (ii) a promoter that directs expression 
of a polypeptide in milk-producing cells, where the promoter is operably linked to the 
nucleic acid sequence, and (iii) a leader sequence that enables secretion of the biofilament 

20 by the milk-producing cells, into milk of the female mammal. Preferably, the female 
mammal is a rodent, a ruminant, or a goat. 

In a fourth aspect, the invention features an animal in which the genome of cells 
that contribute to urine production in the animal contains a nucleic acid molecule that 
includes (i) a nucleic acid sequence encoding a biofilament, (ii) a promoter that directs 

25 expression of a polypeptide urine-producing cells, where the promoter is operably linked 



to the nucleic acid sequence, and (iii) a leader sequence that enables secretion of the 
biofilament by the urine-producing cells, into urine of the animal. Preferably, the animal 
is a mammal. 

In one embodiment of all of the above aspects of the invention, the biofilament is 
5 spider silk (e.g., dragline silk). In another embodiment of all of the above aspects, the 
nucleic acid molecule includes an intron. In a preferred embodiment, when secreted such 
that the secretion is subjected to shear forces and mechanical extension, the biofilament 
has a poly-alanine segment that undergoes a helix to a P-sheet transition, where the 
transition forms a p-crystal that stabihzes the structure of the biofilament. 

10 In another preferred embodiment of all of the above aspects of the invention, the 

biofilament has an amorphous domain that forms a p-pleated sheet such that inter- p sheet 
spacings are between 3 angstroms and 8 angstroms in size. In yet another preferred 
embodiment, the biofilament has a C-terminal amino acid motif containing an amorphous 
domain and a crystal forming domain, where the motif has a sequence that is at least 50% 

1 5 identical to SEQ ID NO: 2. In yet another preferred embodiment, the biofilament has a 
consensus sequence that is at least 50% identical to SEQ ID NO: 3. In 

In a fifth aspect, the invention features a method for producing a biofilament that 
includes the steps of: (a) providing an embryonal cell transfected with a biofilament- 
encoding nucleic acid molecule that expresses and causes secretion of the biofilament 

20 fi-om a cell derived from the transfected embryonal cell; (b) growing the embryonal cell to 
produce an animal containing biofilament expressing and secreting cells; and (c) isolating 
the biofilament from the biofilament expressing and secreting cells from the animal. 

In a sixth aspect, the invention features a method for producing a biofilament that 
includes the steps of: (a) providing an animal cell transfected with a nucleic acid 

25 molecule that contains (i) a nucleic acid sequence encoding a biofilament, (ii) a promoter 
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that directs expression of a polypeptide in an animal cell, and (iii) a leader sequence that 
causes secretion of the biofilament by the cell; (b) culturing the transfected cell; and (c) 
isolating the biofilament from the culture medium of the cultured transfected cell. In 
other embodiments, the biofilament has a sequence that is 50% identical to SEQ ID NO: 
5 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, 
SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, or 
SEQ ID NO: 20. 

In one embodiment of the fifth and sixth aspects of the invention, the biofilament 
is spider silk {e.g., dragline silk). In another embodiment, the nucleic acid molecule 
10 contains an intron. In yet another embodiment, the animal is a mammal. In a preferred 
embodiment, when secreted such that the secretion is subjected to shear forces and 
mechanical extension, the biofilament has a poly-alanine segment that undergoes a helix 
to a P-sheet transition, where the transition forms a p-crystal that stabilizes the structure 
of the biofilament. 

15 In another preferred embodiment of the fifth and sixth aspects of the invention, the 

biofilament has an amorphous domain that forms a p-pleated sheet such that inter- p sheet 
spacings are between 3 angstroms and 8 angstroms in size. In another preferred 
embodiment, the biofilament has a C-terminal amino acid motif containing an amorphous 
domain and a crystal forming domain, the motif having a sequence that is at least 50% 

20 identical to SEQ ID NO: 2. In yet another preferred embodiment, the biofilament has a 
consensus sequence that is at least 50% identical to SEQ ID NO: 3. In other 
embodiments, the biofilament has a sequence that is 50% identical to SEQ ID NO: 9, 
SEQ ID NO: 10, SEQ ID NO: 1 1, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, 
SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, or 

25 SEQ ID NO: 20. 
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In a preferred embodiment of all of the above aspects of the invention, where the 
biofilament is dragline silk, preferably the cell (or animal (e.g., a mammal) developed 
from the cell) produces both of the polypeptides (e.g., ADF-3 and ADF-4) required to 
produce the dragline silk at a ratio that is at least 40% of the ratio of these polypeptides 
5 produced in the natural source (e.g. , in the spider). 

By "biofilament" is meant a fibrous protein that is normally produced and secreted 
by any one of a variety of insects and arachnids. Biofilaments are composed of 
altemating crystalline and amorphous regions. Exemplary biofilaments include spider 
silk, an extemally spun fibrous protein secretion found in a variety of arachnids (e.g., 

10 Nephila clavipes), and fibroin, an extemally spun fibrous protein secretion found in a 
variety of insects (e.g., Bombyx mori). Preferable biofilaments, when secreted such that 
the secretion is subjected to shear forces and mechanical extension, have a poly-alanine 
segment, forming a crystal-forming domain, that undergoes a helix to (3-sheet transition, 
forming a p-crystal that stabilizes its structure. Preferably, the amorphous domain of a 

15 biofilament forms a p-pleated sheet such that inter-p sheet spacings are between 3 

angstroms and 8 angstroms in size, preferably, between 3.5 angstroms and 7.5 anstroms 
in size. 

Preferably, a biofilament has a C-terminal portion with an amino acid repeat motif 
which is between 20-40 amino acids in length, more preferably, 34 amino acids in length, 
20 and a consensus sequence which is between 35 and 55 amino acids in length, more 

preferably 47 amino acids in length. Preferably, a biofilament has an amino acid repeat 
motif (creating both the amorphous domain and the crystal- forming domain) having a 
sequence that is at least 50% identical to the sequence of SEQ ID NO: 2, more preferably, 
at least 70% identical, and most preferably, at least 90% identical to SEQ ID NO: 2. 



Preferably, a biofilament has a consensus sequence that is at least 50% identical to the 
sequence of SEQ ID NO: 3, more preferably, at least 70% identical, and most preferably, 
at least 90% identical to SEQ ID NO: 3. 

By a "promoter" is meant a nucleic acid sequence sufficient to direct transcription. 
5 Also included in the invention are those promoter elements which are sufficient to render 
promoter-dependent gene expression controllable for cell type-specific, tissue-specific or 
inducible by external signals or agents; such elements may be located in the 5* or 3^ 
regions of the native gene. Preferred promoters of the invention direct transcription of a 
protein in a milk-producing cell; such promoters include, without limitation, promoters 

10 from the following genes: whey acidic protein, aSl -casein, aS2-casein, P-casein, k- 
casein, P-lactoglobin, and a-lactalbumin. Other preferred promoters of the invention 
direct transcription of a protein in a urine-producing cell (e.g., a uroepithelial cell or a 
kidney cell); such promoters include, without limitation, the promoter from the uroplakin 
gene. Yet another preferred promoter of the invention directs transcription of a protein in 

15 an embryonal cell. 

By a "leader sequence" or a "signal sequence" is meant a nucleic acid sequence 
which, when operably linked to a nucleic acid molecule, allows for the secretion of the 
product of the nucleic acid molecule. The leader sequence is preferably located 5' to the 
nucleic acid molecule. Preferably, the leader sequence is obtain from same gene as the 

20 promoter that is used to direct the transcription of the nucleic acid molecule, or is 
obtained from the gene from which the nucleic acid molecule is derived. 

By "culture medium" is meant the medium surrounding a cell, and in which the 
cell has been living. If the cell is secreting a protein (e.g., a biofilament), the culture 
medium of the cell will contain the protein secreted by that cell. 

25 By a "transfected cell" or a "transformed cell" is meant a cell into which (or into 
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an ancestor of which) has been introduced, by means of recombinant DNA techniques, a 

nucleic acid molecule encoding a polypeptide of the invention. Preferably, the cell is a 

eukaryotic cell from a multicellular animal (e.g., a mammal). 

By an "embryonal cell" is meant a cell that is capable of being a progenitor to all 
5 the somatic and germ-line cells of an organism. Exemplary embryonal cells are 

embryonic stem cells (ES cells) and fertilized oocytes. Preferably, the embryonal cells of 

the invention are mammalian embryonal cells. 

By "endogenous," as used herein in reference to a gene or a polypeptide, is meant 

a gene or polypeptide that is normally present in an animal. 
10 By "germ-line cell" is meant a eukaryotic cell, progenitor, or progeny thereof, 

which is a product of a meiotic cell division. 

By "operably linked" is meant that a nucleic acid molecule and one or more 

regulatory sequences (e.g., a promoter) are connected in such a way as to permit 

expression and/or secretion of the product (Le,, a polypeptide) of the nucleic acid 
15 molecule when the appropriate molecules (e.g., transcriptional activator proteins) are 

bound to the regulatory sequences. 

By "transgene" is meant any piece of nucleic acid that is inserted by artifice into a 

cell, or an ancestor thereof, and becomes part of the genome of the animal which 

develops from that cell. Such a transgene may include a gene which is partly or entirely 
20 heterologous (i.e., foreign) to the transgenic animal, or may represent a gene homologous 

to an endogenous gene of the animal. 

By "transgenic" is meant any cell which includes a nucleic acid sequence that has 

been inserted by artifice into a cell, or an ancestor thereof, and becomes part of the 

genome of the animal which develops from that cell. Preferably, the transgenic animals 
25 are transgenic mammals (e.g., rodents or ruminants). Preferably the nucleic acid 
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(transgene) is inserted by artifice into the nuclear genome. 



Brief Description of the Drawing 
Fig. lA is a schematic representation of the goat p-casein/NcDS-1 construct 
containing the P-casein promoter and signal sequence, the 1.5 kb NcDS-1 cDNA, and the 
5 3^ UTR from P-casein. 

Fig. IB is a schematic representation of the Whey Acidic Protein (WAP)/NcDS-l 
construct containing the WAP gene promoter, the WAP signal sequence, a 1.5 kb cDNA 
encoding dragline silk (NcDS-1), and the 3' end of the WAP gene. 

Fig. IC is a schematic representation of the uroplakin II/NcDS-1 construct 
10 containing the promoter and signal sequence from the uroplakin II gene, the NcDS-1 
cDNA insert, and the 3* UTR region from the mouse protamine- 1 (Mp-1) gene. 



Detailed Description 
Biofilaments, such as spider silk, have a number of high performance mechanical 
properties that make them comparable to the "super- filaments" Spectra™ (commercially 

15 available from AUiedSignal) or Kevlar™ (commercially available from DuPont). Of 
particular importance is the energy to break a biofilament, which means that biofilaments 
can absorb energy when stretched, and dissipate that energy as heat when the stress is 
removed. Furthermore, biofilaments are resistant to digestion by proteolytic enzymes, 
and are insoluble in dilute acids and bases. 

20 Biofilaments are a natural source of material that is renewable. Transgenic 

animals expressing biofilaments, such as the animals described herein, may be managed 
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as normal livestock using available feedstuffs. Thus, the manufacture of this commodity 
is domestic and renewable. The truly natural source, the spiders themselves, are, 
unfortunately, too small to produce significant amounts of this material. In addition, 
since spiders manifest territorial behavior, they cannot be raised in close quarters for mass 
5 production. 

The weight of the biofilament is important for ballistic protection applications. 
The demands in modem ballistic protection demand a flexible and light material that 
retains its unique mechanical properties. Biofilaments are lighter than Kevlar™, and are 
more flexible than Spectra™. 

10 

L Spider silk-encoding genes 

The size of the native dragline silk protein has not been determined conclusively, 
but has been observed to range from 274 kDa to 750 kDa. The N-terminus amino acid 
sequence and the 5*-end nucleic acid sequence have not been elucidated. Partial cDNA 
15 clones (spidroin 1 and 2) from Nephilia clavipes (the golden orb weaver found in Brazil 
and southern Florida) and from Araneus diadematus (Guerette et ah. Science 272: 
1 12-115, 1996) have been isolated and found to be part of the 3'-end of their 
corresponding genes. A summary of the clones isolated from Araneus, and the glands 
from which they are derived, is shown below in Table 1 . 
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Table 1 



Araneus diadematus genes ^libraries screened using spidroin-1. 2 from N, clavipes) 
Distribution-Expression of specific silk type (as determined by Northem Blot) 



Gene 
Cloned 


major ampullae 
gland (producing 
dragline silk) 


minor ampullae 
gland (producing 
accessory fiber) 


flagelliform 
gland 
(producing 
viscid silk) 


aggregate 
gland 
(producing 
viscid silk) 


cyclindrical 
gland 
(producing 
cocoon silk) 


ADF-1 




major 








ADF-2 










minor 


ADF-3 


major 




minor 


minor 




ADF-4 


major 











10 The sequence of ADF-1 {Araneus diadematus fibroin) is 68% poly(A)5 {i.e., 

AAAAA; SEQ ID NO: 9) or (GA)2.7 (SEQ ID NO: 10), and 32% GGYGQGY (SEQ ID 
NO: 1 1); ADF-2 is 19% poly(A)8 (SEQ ID NO: 12) and 81% [GGAGQGGY (SEQ ID 
NO: 13) and GGQGGQGGYGGLGSQGA (SEQ ID NO: 14)]; ADF-3 is 21% 
ASAAAAAA (SEQ ID NO: 15) and 79% [(GPGQQ)n, where n= 1-8 (SEQ ID NO: 16) 

15 and GPGGQGPYGPG (SEQ ID NO: 17)]; and ADF-4 is 27% SSAAAAAAAA (SEQ ID 
NO: 18) and 73% [GPGSQGPS (SEQ ID NO: 19) and GPGGY (SEQ ID NO: 20)]. 

Any of a variety of procedures well known in the art may be utilized to clone 
additional biofilament-encoding genes using the known nucleic acid sequences from 
Nephila and Araneus, and one so skilled will routinely adapt one of these methods in 

20 order to obtain the desired gene. The full length clones of Nephila and Araneus genes 
may be likewise isolated. 

One such method for obtaining a biofilament-encoding gene sequence is to use an 
oligonucleotide probe generated by the Nephila clavipes spidroin 1 gene sequence 
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(Arcidiacono et al, Appl. Microbiol. BiotechnoL 49: 31-38, 1998) to screen an arachnid 
or insect cDNA or genomic DNA library for sequences which hybridize to the probe. 
Hybridization techniques are well known to the skilled artisan, and are described, for 
example, in Ausubel et ah, supra, and Sambrook et al, supra. cDNA or genomic DNA 

5 library preparation is also well known in the art. A large number of prepared nucleic acid 
libraries from a variety of species are also commercially available. The oligonucleotide 
probes are readily designed using the sequences described herein and standard techniques. 
The oligonucleotide probes may be based upon the sequence of either strand of DNA 
encoding the spidroin 1 gene product. Exemplary oligonucleotide probes are degenerate 

10 probes (z.e., a mixture of all possible coding sequences for the A^. clavipes spidroin 1 
protein). 

11. Spider silk formation and spinning 

Silks or fibroin monomers are produced within specialized glands in the abdomen 
of the spider and are retained in a lumen at a concentration of 20-30%. This material will 

15 polymerize when tension is appUed in the presence of drying. As the silk is very thin, the 
drying process is rapid. Silks are composed of alternating crystalline and amorphous 
regions. The dominant crystals are P-pleated sheet crystals that are arranged parallel to 
the long axis of the silk fiber. In Bombyx mori (silkworms), the crystal-forming domain 
is a six peptide (GAGAGS (SEQ ID NO: 1)) domain which altemates with amorphous 

20 domains. 

The amino acids G, A, and S make up greater than 85% of silkworms' cocoon silk 
fibroin. Silk's altemating Ala or Ser and Gly residues extend to opposite sides of a given 
p-sheet so that the Ala side chains extending from one p-sheet efficiently nestle between 
those of the neighboring sheet, and likewise for the Gly side chains. Gly side chains from 
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neighboring p-sheets are in contact, as are those of Ala and Ser. The inter-sheet spacings, 
consequently, have the alternating values 3.5 angstroms and 5.7 angstroms, as determined 
from X-ray diffraction studies ofNephila dragline silk. This spacing is identical to that of 
synthetic poly-L-alanine peptides in their P-sheet conformation. X-ray diffraction studies 

5 of Araneuus diadematus dragline silk indicates a larger inter-sheet spacing of about 7.5 
angstroms (Gosline et al, Biomimetics: Design and Processing of Materials, pp: 237-261, 
eds. Sarikaya and Aksay, Amer. Inst, of Physics, 1995). 

In Nephila, the major ampullae gland (MA gland) produces the dragline silk. The 
C-terminal portion of the Nephila MA-gland produced genes cloned so far (spidroin 1 

10 and spidroin 2) have the following sequences: a 34 amino acid long repeat motif (forming 
both the amorphous domain and crystal-forming domain); and a 47 amino acid long 
consensus sequence (Xu and Lewis, Proc. Natl. Acad. Sci. U.S.A. 87: 7120-7124, 1990; 
Beckwitt and Arcidiacono, J. Biol. Chem. 269: 6661-6663, 1994). 

The 34 amino acid long repeat motif (forming both the amorphous domain and 

15 crystal-forming domain) of Nephila spidroin 1 has the following sequence: 

AGQ GGY GGL GSQ GAG RGG LGG QGA GAA AAA AAG G (SEQ ID NO: 2). 
Note the poly-alanine region (AAAAAAA) at the C-terminal end of SEQ ID NO: 2, as 
well as numerous glycine blocks forming amorphous domains (two glycine residues 
separated by 3 amino acids; e.g., GG LGG). 

20 The 47 amino acid long consensus sequence of Nephila spidroin 2 has the 

following sequence: 

CPG GYG PGQ QCP GGY GPG QQC PGG YGP GQQ GPS GPG SAA AAA 
AAA AA (SEQ ID NO: 3). 

The silk is secreted within individual glands; when the secretion is subjected to 
25 shear forces and mechanical extension, the poly-alanine (crystal- forming) segments 
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undergo a helix to p-sheet transition, forming p-crystals that stabilize its structure. The 
glycine blocks are designated as portions forming the amorphous polypeptide chains 
interspersed among the crystalline regions. 

Biofilaments have evolved in certain insects and arachnids having very specialized 
5 anatomical adaptations and gene evolution. In spiders, the silk is produced by a series of 
abdominal glands. The formation and size of the crystal may depend upon, among other 
things, the primary amino acid sequence composition. The production and secretion of 
the major silks occurs in the major ampullae (MA) gland, the flagelliform (FL) gland, and 
the cylindrical (CY) gland, which produce dragline silk, viscid silk, and cocoon silk, 
10 respectively. It will be understood that the transgenic animals described herein, and the 
constructs used to generate such animals, may produce any of these biofilaments, or any 
variations thereof, such that a biofilament is produced having inter-p-sheet spacings of 
between 3.5 to 7.5 angstroms. 

A. Dragline Silk. 

15 The MA gland produces two proteins, at a 3:2 ratio, which are rich in glycine, 

alanine, and proline. These proteins form the dragline silk, which is the lifeline, the 
scaffolding silk, and the frame for spider webs. Dragline silk is a high stiffness fiber and 
has properties similar to nylon. Dragline silk contains 20-30% crystal, by volume, and 
has the following characteristics: stiff (initial Young's modulus is 10 GPa), strong (tensile 

20 strength is 1 .5 GPa), and tough (energy required to break is 1 50 MJm"^). 

B. Viscid Silk 

The viscid silk produced by the FL gland forms the sticky spiral of the web. No 
genes encoding viscid silk have been identified as yet. Viscid silk contains less than 5% 
crystal by volume, is elastomeric in its native state, and has properties similar to Lycra. It 
25 has the following characteristics: mechanically similar to lightly cross-linked rubber (e.g., 
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spandex), low stiffiiess (initial Young's modulus is 3 MPa), and highly extensible. 
C. Cocoon Silk 

The CY gland produces cocoon silk that is similar to the cocoon silk produced by 
silkworms (B, mori). 

5 TIL Synthesis of biofilament genes 

Because of the difficulties encountered in cloning native silk genes, synthetic 
genes can be cloned and expressed. The cDNA sequences cloned to date share 
similarities in overall organization and in regions of sequence conservation. The 
consensus repeats are rich in glycine and glutamine, with poly (Ala) regions integrated 

1 0 into larger repeating units . 

For discussion purposes, the candidate gene will be represented by one of the 
major dragline genes from K clavipes, spidroin 1 (Arcidiacono et aL, Appl. Microbiol. 
Biotechnol. 49: 31-38, 1998). The highly repetitive nature of these genes raises the 
concerns over the stability of the genes and the possibility of recombination. The repeats 

1 5 can be avoided based on suggestions offered, for example, by Arcidiacono et al , Appl. 
Microbiol, Biotechnol. 49: 31-38, 1998; Fahnestock and Irwin {supra). Furthermore, a 
series of constructs can be generated using a similar strategy (Fahnestock and Irwin, 
Appl. Microbiol. Biotechnol. 47: 23-32, 1997), generating 4-20 (or more) consecutive 
repeats, and can be tested in cell lines prior to the generation of transgenic animals. 

20 Blocks of synthetic repeats are constructed so they have different sizes and contain non- 
coding sequences {e,g.^ introns from casein or immunoglobulin genes) in order to 
facilitate transcription of the encoded biofilamenst and enhance expression. The blocks 
can be altemating using head-to-tail construct strategy (McGrath, K. P., Ph.D. 
Dissertation, University of Massachusetts at Amherst, 1991; Ferrari et al, U. S.P.N. 
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5,243,038). 

Codon selection may also be designed to maximize expression, since premature 
termination may occur if the gene contains a greater number of codons recognized by 
tRNA species present in lower abundance in the cell (Rosenberg et ah, J. Bacteriol. 175: 
5 716-722, 1993, Manley, J. Mol. Biol. 125:407-432, 1978). The genes to be expressed are 
designed and synthesized using codons favored in the tissue being expressed casein 
genes in the mammary gland; uroplakins in the bladder). 

Given the high frequency of alanine residues in biofilament proteins, it may be 
desirable to supplement the cell culture media of the the cell line with additional ala 
10 and/or gly amino acids to prevent the depletion of Ala tRNA and/or Gly tRNA pools from 
being the rate-limiting step to generating the biofilament protein. 

IV. Assembly of Expression Vectors 

Eukaryotic expression vectors may be generated which drive the synthesis and 

secretion of proteins (e.g., biofilament proteins) in the milk or urine of an animal 
15 transgenic for a nucleic acid molecule encoding such a protein. These vectors are 

prepared according to standard molecular biology techniques. 

The synthesized nucleic acid molecule(s) may have a sequence encoding an 

epitope tag attached for easy identification and/or purification of the encoded polypeptide. 

Such purification may be accomplished, for example, by affinity chromatography for the 
20 epitope tag. A site-specific proteolytic or chemical agent recognition site may be added 

to the sequence to facilitate removal of the epitope tag following purification of epitope- 

tagged polypeptides (Saito et al., J. Biochem. 101: 123-134, 1987). Preferably, the sites 

and site-specific proteolytic agents cleave at or near the junction of the epitope tag and 

the biofilament protein. 
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A variety of chemical cleavage agents and their recognition sites (in single letter 
code) are known in the art, and include the following: Hydroxylamine (N or G); formic 
acid (D or P), cyanogen bromide (M); or acetic acid (D or P). For example, a cyanogen 
bromide (CNBr) may be used to cleave a Met (M) residue introduced between the epitope 
5 tag and the biofilament protein. 

Alternatively, natural or synthetic proteases may be used. Examples of these (and 
their recognition sites) include enterokinase (DDDK; SEQ ID NO: 5); Factor Xa (lEGR; 
SEQ ID NO: 6); chymotrypsin (W and Y and F); renin (YIHPFHLL; SEQ ID NO: 7); 
trypsin (R and K); and thrombin (RGPR; SEQ ID NO: 8). For example, the epitope tag 
10 may be attached to the biofilament via a thrombin-recognition site. Following affinity 
purification of epitope tag-containing proteins, since biofilaments are generally resistant 
to proteolysis, the epitope tag may be easily removed upon proteolytic cleavage with 
thrombin. 

The expression cassette consists of elements necessary for proper transcription, 
15 translation, and secretion in the desired eukaryotic cell (/.e., a promoter, signal sequence 
for secretion, intron sequences, and a polyadenylation signal). Many milk or urine 
specific promoters can be used with their signal sequences or with the silk and/or fibroin 
gene signal sequence. In the former case, the biofilament-encoding nucleic acid molecule 
should not contain its own translation initiation codon but, rather, should be in frame with 
20 the 3^ end of the signal sequence. The 3' end of the biofilament-encoding nucleic acid 
molecule may contain its own polyadenylation signal, or may contain the 3' untranslated 
sequence normally found on the gene used for the promoter and/or signal sequence. For 
example, a biofilament-encoding nucleic acid molecule may be in an expression vector 
cassette with a promoter, signal sequence, and 3' untranslated sequence all from the same 
25 casein gene. 
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The eukaryotic expression constructs to be used may include one or more of the 
following basic components. 

A) Promoter of transcriptional initiation regulatory region 

These sequences may be heterologous to the cell to be modified and may include 
5 synthetic and natural viral sequences (e.g., human cytomegalovirus immediate early 
promoter (CMV); simian virus 40 early promoter (SV40); Rous sarcoma virus (RSV); or 
adenovirus major late promoter) which confers a strong level of transcription of the 
nucleic acid molecule to which they are operably linked. The promoter can also be 
modified by the deletion of nonimportant sequences and/or addition of sequences, such as 

10 enhancers (e.g., an enhancer element CMV, SV40, or RSV), or tandem repeats of such 
sequences. The addition of strong enhancer elements may increase transcription by 10- 
100 fold. Expression from the above-identified viral promoters is constitutive (i.e., 
expression occurs in the absence of an apparent extemal stimulus). 

Alternatively, for expression in the milk, for example, the promoter region may be 

15 native to a ruminant mammary-specific gene. Examples include: aSl-casein (PCT 
Application Nos.: WO91/08216 and W093/25567), aS2-casein, P-casein (Rosen, J.M,, 
U.S.P.N. 5,304,489; Lee et al. Nucleic Acids Res, 16: 1027-1041, 1988), K-casein, p- 
lactoglobin, and a-lactalbumin (Vilotte et al, Eur. J. Biochem. 186: 43-48, 1989;PCT 
AppHcation No.: WO88/01648). These promoters can drive a high level of expression of 

20 a variety of proteins in a tissue and lactation specific manner. 

B) Intron Inclusion 

Genes containing an intron (z.e., genomic clones) are expressed at higher levels 
than intron-less genes. Hence, inclusion of an intron placed between the transcription 
initiation site and the translational start codon, 3' to the translational stop codon, or inside 
25 the coding region of the biofilament-encoding gene may result in a higher level of 
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expression. 

The intron sequence includes a 5' splice site (donor site) and a 3' splice site 
(acceptor site). Sequences of at least 100 base pairs are found between these two sites. 
The origin of these intronic sequences will be derived from the promoter being used, or 
5 from the native gene (Ichimura and Mita, J. Mol. Evol. 35: 123-130, 1992) and positioned 
5' to the coding sequence of the silk or fibroin gene. Since the highly repetitive nature of 
the construct raises concerns over the stability of the gene and the possibility of 
recombination due to the repetitive sequences, they can be disrupted by inserting the 
introns of the gene from which the promoter is used. The introns can be positioned in a 
10 manner similar to those present in the fibroin gene of Bombyx mori (Tsujimoto and 
Suzuki, Cell 18: 591-600,1979; Tsujimoto and Suzuki, Cell 16: 425-436, 1979). This 
strategy will allow for increased levels of expression in addition to increased stability of 
the gene. 

C) Signal (Leader) Sequences 

15 Each expression vector will contain a signal sequence which directs the expressed 

gene product to be secreted from the mammary or uroepithelial cells. This signal 
sequence is present in any gene which is secreted naturally. A signal sequence from a 
relative fibroin gene (e.g., B. mori heavy and light fibroin gene, P25 and sspl60), from a 
gene specific to the tissue of expression (e.g., casein or uroplakin gene), or a general 

20 signal sequence (e.g., from human alkaline phosphatase, mellitin, and CD33 signal 
peptides) may be used. Furthermore, the signal sequences for secretion may be 
interchanged between mammalian/insect genes; for example, signal sequences from 
mellitin, casein, or the sequences from native silk (from arachnids) or fibroin (from 
insects) genes may be used. 

25 D) Termination Region 
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The transcription termination region of the nucleic acid constructs may involve the 
3 -end and polyadenylation signal from which the 5'-promoter region is derived. For 
example, the bovine aSl casein gene. Alternatively, the 3 '-end of the nucleic acid 
construct will contain transcription termination and polyadenylation signals which are 
5 known to regulate post-transcriptional mRNA stability such as those derived from bovine 
growth hormone, p-globin genes, or the SV40 early region, 
E) Other features of the expression vectors 

The expression vectors designed for gene transfer also contain an origin or 
replication for propagation in E. coli, SV40 origin of replication, ampicillin resistance 

10 gene, neomycin resistance gene for selection in eukaryotic and/or genes (/.e., 

dihydrofolate reductase gene) that amplify the dominant selectable marker plus the gene 
of interest. In addition, the expression vectors may contain appropriate flanking 
sequences at their 5' and 3' ends that will allow for enhanced integration rates in the 
transduced cells (ITR sequences; Lebkowski et aL, Mol Cell. Biol. 8: 3988-3996, 1988). 

15 Furthermore, prolonged expression of the silk or fibroin protein in vitro may be achieved 
by the use of sequences (i.e., EBNA-1 and oriP from the Epstein-Barr virus) that allow 
for autonomous replication (the transduced, circular nucleic acid replicates as a plasmid 
in mammalian cells). 

To clone and propagate large segments of DNA, cosmids are the vectors of choice. 

20 Although plasmid vectors can, in theory, carry large inserts, the resulting recombinants 
transform Escherichia coli very inefficiently. Cosmids have the capacity to propagate 
large pieces of foreign DNA (Royal et al. Nature 279: 125, 1979). 

The expression vectors used for the generation of transgenic animals may be 
linearized by restriction endonuclease digestion prior to transformation of a cell. In a 

25 variant of this method, only a digestion fragment that includes the coding, 5 -end 
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regulatory sequences (e.g., the promoter), and 3'-end regulatory sequences (e.g., the 3' 
untranslated region) from, for example, bovine casein or growth hormone sequences, will 
be used to transform cells. A cell transformed with such a fragment will not, 
consequently, contain any sequences that are necessary solely for plasmid propagation in 
5 bacteria (e.g., the cell will not contain the E, coli origin or repHcation or a nucleic acid 
molecule encoding an antibiotic-resistance protein (e.g., an ampicilUn-resistance protein) 
that is usefiil for selecting prokaryotic cells). 

In another variant of this method, the digestion fragment used to transform a cell 
will include the coding region, the 5' and 3' regulatory sequences, and a nucleic acid 
10 molecule (including a promoter and 3' untranslated region) encoding a protein capable of 
conferring resistance to a antibiotic useful for selecting eukaryotic cells (e.g., neomycin 
or puromycin). 

The biofilament gene of interest may be modified in its 5' untranslated region 
(UTR), its 3' UTR, and/or its region coding for the N-terminus in order to preferentially 

15 improve expression. Altematively, sequences within the coding sequence of the 
biofilament encoding-nucleic acid molecule may be deleted or mutated in order to 
increase secretion and/or avoid retention of the gene product within the cell due, for 
example, to the presence of endoplasmic reticulum (ER) retention signals or other sorting 
inhibitory signals. Furthermore, the transgenic construct may contain sequences that 

20 possess chromatin opening domain activity such that they confer reproducible activation 
of tissue-specific expression of a linked transgene (Ellis et al, PCT Application No.: 
W095/33841; Chung and Felsenfield, PCT Application No.: WO96/04390). 

V. Testing and Characterization of Expression Vectors 

The assembled constructs are partially characterized by sequencing the areas 
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where two pieces of DNA have been fused together by Ugation. A partial restriction 
endonuclease map will provide information on the correct orientation of the ligated 
regulatory sequences with respect to the biofilament-encoding nucleic acid molecule. 
Further characterization of the fiinctionality of the assembled constructs include 

5 their transfection into established mammary epithelial cell lines {e.g. , MAC-Ts) (see 

Turner and Huynh, U.S.P.N. 5,227,301; Huynh et al, Exp. Cell. Res. 197: 191-199, 1991; 
Stampfer et al., U.S.P.N. 4,423,145) and identification of the secreted product. 

The recombinant DNA methods employed in practicing the present invention are 
standard procedures that are well known to those skilled in the art of molecular biology, 

1 0 and are described in detail in, for example, Sambrook, Fritsch and Maniatis, Molecular 
Cloning: A Laboratory Manual (2"'^ ed.l Cold Spring Harbor Press, 1989; Ausubel et al. 
Current Protocols in Molecular Biology . John Wiley & Sons, New York, NY, 1994; and 
Perbal, B.V., A Practical Guide to Molecular Cloning (T"^ ed. . John Wiley & Sons, New 
York, NY, 1988. For purposes of explanation only, the biofilament gene used in the 

1 5 following examples is the 1 .5 kb NcDS-1 insert described and cloned by Arcidiacono et 
al, Appl. Microbiol. Biotechnol. 49: 31-38, 1998. 

The following examples are meant to illustrate, and do not limit, the invention. 

Example 1 : The Casein Promoter 

In the following example, the design of the construct includes the use of the goat 
20 P-casein promoter (Ebert et al, Bio/Technology 12: 699-701, 1993), followed by its own 
signal sequence for expression, followed by a 1 .5 kb insert containing the silk clone 
(Arcidiacono et al, supra) in frame with the 5' and 3' ends of the casein gene. A 
schematic diagram of this construct is shown in Fig. lA. 
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The nucleic acid molecule encoding a biofilament (e.g., a silk or fibroin gene, or 
fragment thereof) is fused to the casein promoter and secretion of the biofilament protein 
is driven by signal sequences from the gene from which the promoter is derived, or from 
the biofilament nucleic acid molecule to be expressed. Termination sequences can be 
5 derived from the biofilament gene itself, or from the promoter gene. Furthermore, a 
hybrid gene can be created to increase the level of expression. For this purpose, the silk 
or fibroin gene (or fragment thereof) can be inserted between exon 2 (just upstream of 
the ATG) and exon 7 (downstream of the stop codon) of the goat p-casein gene (Ebert et 
al, Bio/Technology 12: 699-701, 1993). Since the highly repetitive nature of the 

1 0 construct raises concems over the stabihty of the gene and the possibility of 

recombination due to the repetitive sequences, they can be disrupted by inserting the 
introns from the casein gene (introns 3 to 7). Construction of the vector can be performed 
in a cosmid vector (supercos), due to the large size of the final construct or the backbone 
of the plasmid could consist of any known bacterial vector (preferably ones that accept 

15 large DNA sizes) which contains sequences necessary for its amplification in an E. coli 
host (Sambrook et al, supra). 

Example 2: The Whey Acidic Protein TWAP^ Construct 

The present example demonstrates the generation of a hybrid gene composed of 
the WAP gene promoter, its signal sequence, a 1.5 kb cDNA encoding dragline silk 
20 (Arcidiacono et al, supra), followed by the 3' end of the WAP gene (Velander et al, 
Proc. Natl. Acad. Sci U.S.A. 89: 12003-12007, 1992). The details of its construction are 
described below and diagrammed in Fig. IB, 

Whey acidic protein (WAP), the major whey protein in rodents, is expressed at 
high levels exclusively in the mammary gland during late pregnancy and lactation (Hobbs 
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et al, J. Biol. Chem. 257: 3598-3605, 1982). The genomic murine WAP gene consists of 
2.6 kbp (kilo base pairs) of 5' flanking promoter sequence, 3.0 kbp of coding sequence 
(exons and introns), and 16 kbp of 3' flanking DNA (see Fig. IB) (Velander et al, Proc. 
Natl. Acad. Sci U.S.A. 89: 12003-12007, 1992; Velander et al, Ann. N. Y. Acad Sci. 
5 665: 391-403, 1992). 

In one example, a hybrid gene composed of the WAP gene promoter and a cDNA 
encoding dragline silk may be constructed. The hybrid dragline-silk encoding gene is 
created by inserting the gene(s) or nucleic acid molecule(s) of interest (in this case, a 
cDNA encoding dragline silk) between the murine WAP promoter and 5* sequence 

10 (nucleotide position -949 to +33) at the 5' end and the WAP 3' sequence at the 3' end (843 
bp; portion of exon 3, last 30 bases and all of intron C, exon 4, and 70 bp of 3' UTR) 
(Campbell et al, Nucleic Acids Res. 12: 8685-8697, 1984). In this example, the signal 
sequence comes from the native silk gene. One can also use the WAP gene signal 
sequence by including an additional 56 bp at the 5' end (position -949 to +89). 

15 In another example, the hybrid gene is created by inserting nucleotide sequence 

encoding part of the 5' untranslated region and a 19 amino acid signal sequence from the 
murine WAP gene (nucleotides +1 to +90) (Hennighausen et al, Nucl. Acids Res. 10: 
3733-3744, 1982), and by amplifying the 1.5 kb (kilo base) silk gene (NsDS-1; 
Arcidiacono et al, supra) using 5' primers containing the 90 bp (base pair) sequence 

20 encoding the signal sequence flanked by a Kpnl restriction endonuclease recognition site. 
The ampHfication is performed to maintain the correct reading frame, and 3' primers 
creating a Kpnl restriction endonuclease recognition site at the 3' end of the gene. This 
PGR product is then insert at the Kpnl site at the first exon of WAP. The hybrid gene can 
be cut out of the vector by digestion with the EcoRI restriction endonuclease (see Fig. 

25 IB), and purified for microinjection. 
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Example 3: The Uroplakin Promoter 

In these experiments, the uroplakin II promoter (Lin et al, Proc. Natl. Acad. Sci. 
U.S.A. 92:679-683, 1995; Sun, T., PCT AppHcation No.: W096/39494) is used to drive 
the expression of the fibroin or silk gene(s) in the urothelium of transgenic animals. The 
5 fibroin or silk gene(s) is inserted downstream of a 3.6 kb 5' flanking sequence of the 
mouse uroplakin II (UPII) gene (Lin et al, Proc. Natl. Acad. Sci. U.S.A. 92:679-683, 
1995). A sequence containing part of exon 1 and all of intron I and exon 2 of the mouse 
protamine-1 (Mp-1) gene (Peschon etal, Proc. Natl. Acad. Sci. U.S.A. 84: 5416, 1987) is 
placed at the 3' end of the gene to provide an exon/intron splicing site and a 
10 polyadenylation signal. One can also use the signal sequence of the uroplakin II gene by 
inserting the fibroin or silk-encoding gene, or fragment thereof (mature protein only), in 
frame to amino acid 59 of exon 3. A diagram of this construct is shown on Fig. IC. 

Exam ple 4: DNA Shuffling 

Another method for producing a biofilament-encoding gene cassette is by DNA 

1 5 shuffling. DNA shuffling is a process for recombination and mutation, performed by 
random fragmentation of a pool of related genes, followed by reassembly of the 
fragments by primerless PGR (Stemmer, W. P., Nature 370: 389-391, 1994; Stemmer, W. 
P., Proc. Natl. Acad. Sci. U.S.A. 91: 10747-10751, 1994). The goal of DNA shuffling is 
to optimize the fimction of genes without first determining which gene product is rate 

20 limiting. Genes themselves can be "shuffled," or can be from different species {i.e., 

spidroin 1 and 2 and ADF 1-4). Furthermore, different repeats (e.g., GGAGQGGY (SEQ 
ID NO: 4) from ADF 3) within one species can be "shuffled" (with, for example, repeats 
from ADF 1, 2, and 4) and expressed to determine the combination which yields 
favorable characteristics. In addition to point mutation, diversity is generated using a 
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wide variety of mutational mechanisms, such as polynucleotide deletion, insertion, and 
inversion, as well as integration and excision. Once thus generated, the biofilament- 
encoding nucleic acid molecule may be inserted into an expression cassette for secretion 
in milk or urine, and used to transform mammahan cells. 

5 Example 5: Constructs Expressing Two Genes 

For the expression of two genes (e.g., dragline silk consists of a ratio of ADF3 and 
4), either two separate constructs can be generated, or both genes can be cloned into the 
same expression cassette with the insertion of an intervening ribosomal entry site (IRES) 
(Jang et al, J. Virol 62: 2636-2643, 1988). The advantage to cloning both genes into the 
10 same expression cassette is that only one single construct is needed to generate the 
transgenic animal. 

Example 6: Synthesis of Chimeric or Hybrid Biofilaments 

Chimeric molecules are synthesized that include a fusion of the silk (containing 
either crystal or amorphous domains) in frame with one or more domains of collagen or 
15 fibrillin, that may allow for increased crosslinking, stability, or elasticity. Such a chimera 
is especially useful as a surgical biomaterial. 

Example 7: Production of Biofilaments in Mammary Epithehal or Uroe pithelial Cell 
Lines 

The synthesized biofilament-encoding nucleic acid molecules of the invention are 
20 produced in homologous biologically relevant cell culture systems. Genetic stability of 
the synthetic genes, secretion ability, and attributes of the produced biofilament(s) can, 
thus, be evaluated very efficiently before transgenic animal studies are initiated. 
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Mammary epithelial and urine-producing cell lines are transfected according to 
standard techniques (see, e.g., Ausubel et al, supra) with their respective plasmids (e.g., a 
milk-specific promoter containing expression construct is transfected into mammary 
epitheUal cells). In one example, to demonstrate the secretion of the produced spider 
5 silk(s), culture media is harvested from the stable cell lines (24-96 hours post- 
differentiation for mammary epithelial cells) and resolved by SDS-PAGE for Western 
blotting analysis using anti-spider silk antibodies (commercially available from 
Monsanto, St. Louis, MO), or antibodies raised against peptides having sequences 
homologous to SEQ ID Nos: 1-4. 

10 Example 8: Generation of Transgenic Animals Expressing Biofilam entrs^ in Milk or 
Urine 

In some methods of trangenesis, transgenes are introduced into the pronuclei of 
fertilized oocytes. For some animals, such as mice, fertilization is performed in vivo and 
fertilized ova are surgically removed. In other animals, the ova can be removed from 

15 live, or from newly-dead {e.g., slaughterhouse) animals and fertilized in vitro. 

Transgenes are usually introduced by microinjection (Ogata et al, U.S.P.N. 4,873,292). 
The microinjected zygotes are ti-ansferrred to an appropriate female resulting in the birth 
of a transgenic or chimeric animal, depending upon the stage of development when the 
transgene is integrated. Chimeric animals can be bred to form tine germline transgenic 

20 animals. 

Alternatively, transgenes can be introduced into embryonic stem cells (ES cells). 
Transgenes can be introduced into such cells by electroporations, microinjection, or any 
other techniques used for the ti:ansfection of cells which are known to the skilled artisan. 
Transformed cells are combined with blastocysts from the animal from which they 
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originate. The cells colonize the embryo, and in some embryos these cells form the 
germline of the resulting chimeric animal (Jaenisch, R., Science 240: 1468-1474, 1988). 
Alternatively, ES cells can be used as a source of nucleic for transplantation into an 
enucleated fertilized oocytes, thus giving rise to a transgenic animal. 

Production of a Biofilament in Milk 

In one example, the spider silk gene is subcloned into an expression vector that 
contains a casein gene promoter, such that the expression of the spider silk gene is 
controlled by the casein gene promoter. One such expression vector is a casein 
expression vector based on the vectors described by Rosen, J. M. (U.S.P.N. 5,304,489) 
and Meade and Lonberg (U.S.P.N. 4,873,316). 

In another example, the WAP/silk gene hybrid fragment (the EcoRI fragment 
described above in Example 2) is purified and microinjected into fertilized mouse eggs, 
which are then implanted into foster mothers. The presence of the WAP/silk gene is 
identified by Southern blot analysis of tail DNA (DNA isolated from tail tissue) in 
accordance with methods well known in the art. Positive founder animals are then back- 
crossed to generate hemizygous animals that are used for studying fransgene expression. 

Production of a Biofilament in Urine 

The spider silk gene, for example, may be subcloned into an expression vector 
such that the spider silk gene is under the transcriptional control of the uroplakin 
promoter (described by Lin et al, Proc. Natl. Acad. Sci. U.S.A. 92:679-683, 1995). 

Urine has the added advantage that its pH and salt composition can be modulated 
to allow for the incorporation of certain components that may influence the formation of 
the silk in a most native structure. 
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Exam ple 9: Generation of "Double Transgeriic" Animals that Produ ce Riofilaments in 
Rntli Milk and Urine 

A transgenic animal may be generated that produces a biofilament protein (e.g., 
spider silk) in both its urine and its milk. One method for constructing such an animal is 
5 to transform an embryonal cell of the animal with two constructs, one construct in which 
the expression of the biofilament-encoding nucleic acid molecule is directed by a 
promoter capable of expressing and secreting the biofilament from a milk-producing cell; 
and a second construct in which the expression of the biofilament-encoding nucleic acid 
molecule is directed by a promoter capable of expressing and secreting the biofilament 

1 0 from a urine-producing cell. In this method, the doubly-transformed cell is used to 
generate a transgenic animal. 

Hence, mammalian (e.g., ruminant) zygotes are microinjected (or co- 
microinjected) with two nucleic acid fragments: one that expresses the biofilament 
protein under the control of a milk promoter, and one that expresses the biofilament 

1 5 protein under the influence of a urine specific promoter. The generated transgenic animal 
will be secreting/producing the biofilament in both its milk and in its urine. This will 
increase the total output of biofilaments produced per transgenic animal unit. 

A second method for producing such an animal capable of producing a biofilament 
in both its milk and its urine is to separately generate an embryonic stem cell carrying a 

20 construct capable of expressing and secreting the biofilament in a milk-producing cell and 
an embryonic stem cell carrying a construct capable of expressing and secreting the 
biofilament in a urine-producing cell. Both transformed ES cell types are then combined 
with blastocysts from the animal from which they originate to produce chimeric animals, 
which may then be bred to homozygosity. 

25 This type of double-expressing animal has a number of advantages. First, animals 
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of both genders will produce biofilaments in the urine on a continual basis from birth, and 
female animals will then be able to produce additional biofilament protein in their milk as 
a lactating adult. And second, the amount of biofilament produced by any individual 
female animal may be increased (by inducing lactation) or reduced (by not inducing 
5 lactation) as the need for biofilament changes. 

F.xample 1 0: Generation of transgenic animals that co express and assemble a Biofilament 

protein from two monomers 

A transgenic animal may be generated that produces two proteins which are both 

required for the correct assembly of a biofilament protein in its milk, urine, or both. For 
10 example, Araneus diadematus draghne silk consists of a ratio of ADF3 and 4). One 

method for constructing such an animal is to transform an embryonal cell of the animal 

with two constructs, each expressing one the two proteins. These proteins may be 

expressed in the urine or milk, or both. 

In a second method, the construct described in Example 5 which encodes a 
1 5 polycisfronic message may be employed. This construct may be able to secrete its 

products in the milk, urine, or both. Once the two proteins are present in the same fluid 

{i.e., both in the milk or both in the urine), they can assemble correctly and be purified. 

What is claimed is: 
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Claims 

1 . A nucleic acid molecule comprising (i) a nucleic acid sequence encoding a 
biofilament, (ii) a promoter that directs expression of a polypeptide in milk-producing 
cells or urine-producing cells, said promoter operably linked to said sequence, and (iii) a 

5 leader sequence that enables secretion of said biofilament by said milk-producing cells or 
said urine-producing cells, into, milk or urine, respectively, of a mammal. 

2. A mammalian embryo whose nucleus comprises the nucleic acid molecule of 
claim 1. 

3. A female mammal in which the genome of the mammary tissue of said female 
1 0 mammal comprises the nucleic acid molecule of claim 1 , wherein said promoter is milk- 
producing cell-specific. 

4. The mammal of claim 3, wherein said mammal is a selected from the group 
consisting of a rodent, a ruminant, and a goat. 

5. An animal in which the genome of cells that contribute to urine production in 
1 5 said animal comprises the nucleic acid molecule of claim 1 , wherein said promoter is 

urine-producing cell-specific. 

6. The animal of claim 5, wherein said animal is a mammal. 

7. The nucleic acid molecule of claim 1, wherein said biofilament is spider silk. 
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8. The nucleic acid molecule of claim 7, wherein said spider silk is dragline silk. 

9. The nucleic acid molecule of claim 1, wherein said biofilament, when secreted 
such that the secretion is subjected to shear forces and mechanical extension, has a poly- 
alanine segment that undergoes a helix to a p-sheet transition, said transition forming a p- 

5 crystal that stabilizes the structure of said biofilament. 

10. The nucleic acid molecule of claim 1, wherein said biofilament has an 
amorphous domain that forms a p-pleated sheet such that inter-p sheet spacings are 
between 3 angstroms and 8 angstroms in size. 

1 1 . The nucleic acid molecule of claim 1 , wherein said biofilament has a C- 
1 0 terminal amino acid motif comprising an amorphous domain and a crystal forming 

domain, said motif having a sequence that is at least 50% identical to SEQ ID NO: 2. 

12. The nucleic acid molecule of claim 1, wherein said biofilament has a 
consensus sequence that is at least 50% identical to SEQ ID NO: 3. 



13. A method for producing a biofilament, said method comprising the steps of: 
1 5 (a) providing an embryonal cell transfected with a biofilament encoding nucleic 

acid molecule that expresses and causes secretion of said biofilament fi-om a cell derived 
fi:om said transfected embryonal cell; 

(b) growing said embryonal cell to produce an animal comprising biofilament 
expressing and secreting cells; and 
20 (c) isolating said biofilament from said biofilament expressing and secreting cells 
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from said animal. 

14. A method for producing a biofilament, said method comprising the steps of: 

(a) providing an animal cell transfected with a nucleic acid molecule that contains 
(i) a nucleic acid sequence encoding a biofilament, (ii) a promoter that directs expression 
of a polypeptide in an animal cell, and (iii) a leader sequence that causes secretion of said 
biofilament by said cell; 

(b) culturing said transfected cell; and 

(c) isolating said biofilament from the culture medium of said cultured transfected 

cell. 

15. The method of claim 13 or 14, wherein said biofilament is spider silk. 

16. The method of claim 15, wherein said spider silk is dragline silk. 

17. The method of claim 13 or 14, wherein said biofilament, when secreted such 
that the secretion is subjected to shear forces and mechanical extension, has a poly- 
alanine segment that undergoes a helix to a P-sheet transition, said transition forming a P- 
crystal that stabihzes the structure of said biofilament. 

18. The method of claim 13 or 14, wherein said biofilament has an amorphous 
domain that forms a p-pleated sheet such that inter-p sheet spacings are between 3 
angstroms and 8 angstroms in size. 

19 . The method of claim 13 or 14, wherein said biofilament has a C-terminal 
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amino acid motif comprising an amorphous domain and a crystal forming domain, said 
motif having a sequence that is at least 50% identical to SEQ ID NO: 2. 

20. The method of claim 13 or 14, wherein said biofilament has a consensus 
sequence that is at least 50% identical to SEQ ID NO: 3. 

21. The method of claim 13 or 14, wherein said animal is a mammal. 
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PRODUCTION OF BTOFIT,AMRNTS IN TRANSGENIC ANIMALS 



Abstract of the Disclosure 
Disclosed is a method for the recombinant production of biofilaments, such as 
spider silk or insect fibroins, using transgenic animals which secrete the biofilaments in 
their milk and/or urine, and transgenic cells which secrete the biofilaments into culture 
media. Such a method is useful for producing large quantities of biofilament material. 
Also disclosed is a nucleic acid molecule for generating such transgenic animals. 
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As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below next to my name. 

I believe I am the original, first and sole inventor (if only one name is listed below) or an original, first and 
joint inventor (if plural names are listed below) of the subject matter which is claimed and for which a 
patent is sought on the invention entitled PRODUCTION OF BIOFILAMENTS IN TRANSGENIC 
ANIMALS, the specification of which 

■ is attached hereto. 

□ was filed on as Application Serial No. 

and was amended on . 

□ was described and claimed in PCT International Application No. 

filed on and as amended under PCT Article 19 on . 

I hereby state that I have reviewed and understand the contents of the above-identified specification, 
including the claims, as amended by any amendment referred to above. 

I acknowledge the duty to disclose all information I know to be material to patentability in accordance with 
Title 37, Code of Federal Regulations, §1 .56(a). 

I hereby claim foreign priority benefits under Title 35, United States Code, §1 19 of any foreign 
application(s) for patent or inventor's certificate or of any PCT international application(s) designating at 
least one country other than the United States of America listed below and have also identified below any 
foreign application for patent or inventor's certificate or any PCT international application (s) designating at 
least one country other than the United States of America filed by me on the same subject matter having a 
filing date before that of the application(s) of which priority is claimed: 



Country 


Serial Number 


Filing Date 


Priority Claimed? 








Yes/No 








Yes/No 








Yes/No 



I hereby claim the benefit under Title 35, United States Code, §120 of any United States application(s) 
listed below and, insofar as the subject matter of each of the claims of this application is not disclosed in 
the prior United States application in the manner provided by the first paragraph of Title 35, United States 
Code, §1 12, 1 acknowledge the duty to disclose all information 1 know to be material to patentability as 
defined in Title 37, Code of Federal Regulations, §1 .56(a) which became available between the filing date 
of the prior application and the national or PCT international filing date of this application: 
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1 hereby appoint the following attorneys and/or agents to prosecute this application and to transact all 
business in the Patent and Trademark Office connected therewith: Paul T. Clark, Reg. No. 30,162, Karen 
L. Elbing, Reg. No. 35,238, Kristina Bieker-Brady, Reg. No. 39,109, and Evelyn D. Shen, Reg. No. 39,834. 



Address all telephone calls to: Paul T. Clark at 617/428-0200. 

Address all correspondence to: Paul T. Clark, Esq. at Clark & Elbing LLP, 176 Federal Street, Boston, MA 
02110. 

1 hereby declare that all statements made herein of my own knowledge are true and that all statements 
made on information and belief are believed to be true; and further that these statements were made with 
the knowledge that willful false statements and the like so made are punishable by fine or imprisonment, 
or both, under Section 1001 of Title 18 of the United States Code and that such willful false statements 
may jeopardize the validity of the application or any patents issued thereon. 
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(First, Middle, Last) 


Residence Address 
(City, State, Country) 


Post Office Address 
(Street, City, State, Country) 


Citizenship 


Costas N. Karatzas 


Quebec, Canada 


251 Shenwood, 
Beaconsfield, Quebec, 
Canada H9W 2H4 


Greek 


Signature: 


Date: 
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(First. Middle, Last) 


Residence Address 
(City, State, Country) 


Post Office Address 
(Street, City, State, Country) 


Citizenship 


Jeffrey D. Turner 


Quebec, Canada 


1 Riley Road, 
Hudson, Quebec, 
Canada JOP 1H0 


Canadian 


Signature: 


Date: 
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Karatzas 
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251 Sherwood, 
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Canadian 
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Applicant or Patentee 
Serial or Patent No. 
Filed or Issued 
Title 



Costas N. Karatas et al. 



March 17, 1998 

PRODUCTION OF BIOFILAMENTS IN TRANSGENIC ANIMALS 



VERIFIED STATEMENT (DECLARATION) CLAIMING SMALL ENTITY STATUS 
(37 CFR 1.9(f) and 1.27(c)) - SMALL BUSINESS CONCERN 



I hereby declare that I am 



[ ] 
[X] 



the owner of the small business concern identified below: 

an official of the small business concern empowered to act on behalf of the concern identified below: 



Name of Small Business Concern: NEXIA BIOTECHNOLOGIES. INC. 

Address of Small Business Concern: 21.111 LAKESHQRE ROAD. ST E. ANNE DE BELLEVUE, QUEBEC, CANADA 

1 hereby declare that the above identified small business concern qualifies as a small business concern as defined in 13 CFR 
121 12 and reproduced in 37 CFR 1 .9(d), for purposes of paying reduced fees to the United States Patent and Trademark Office, 
in that the number of employees of the concern, including those of its affiliates, does not exceed 500 persons. For purposes of this 
statement, (1 ) the number of employees of the business concern is the average over the previous fiscal year of the concern of the 
persons employed on a full-time, part-time or temporary basis during each of the pay periods of the fiscal year, and (2) concerns 
are affiliates of each other when either, directly or indirectly, one concern controls or has the power to control the other, or a third 
party or parties controls or has the power to control both. 

I hereby declare that rights under contract or law have been conveyed to and remain with the small business concern identified 
above with regard to the invention, entitled PRODUCTION OF BIOFILAMENTS IN TRANSGENIC ANIMALS 
by inventors Costas N. Karatzas, Jeffrey D. Turner, Anthoula Lazaris-Karatzas described in 

[X] the specification filed herewith. 

[ ] application serial no. , filed • 

[ ] patent no. ^PATENT NUMBER**], issued TISSUE DATE**]. 

If the rights held by the above identified small business concern are not exclusive, each individual, concern or organization having 
rights to the invention is listed below* and no rights to the invention are held by any person, other than the inventor, who would not 
qualify as an independent inventor under 37 CFR 1 .9(c) if that person made the invention, or by any concern which would not 
qualify as a small business concern under 37 CFR 1 .9(d). or a nonprofit organization under 37 CFR 1 .9(e). *NOTE: Separate 
verified statements are required from each named person, concern or organization having rights to the invention averring to their 
status as small entities. (37 CFR 1.27) 

Assignee Name: 

Assignee Address: 

[ ] INDIVIDUAL [ ] SMALL BUSINESS CONCERN [ ] NONPROFIT ORGANIZATION 

I acknowledge the duty to file, in this application or patent, notification of any change in status resulting in loss of entitlement to 
small entity status prior to paying, or at the time of paying, the earliest of the issue fee or any maintenance fee due after the date or 
which status as a small entity Is no longer appropriate. (37 CFR 1.28(b)) 

i hereby declare that all statements made herein of my own knowledge are true and that all statements made on information and 
belief are believed to be true; and further that these statements were made with the knowledge that willful false statements and the 
like so made are punishable by fine or imprisonment, or both, under section 1 001 of Title 1 8 of the United States Code, and that 
such willful false statements may jeopardize the validity of the application, any patent issuing thereon, or any patent on which this 
verified statement is directed. 

Name: Jeffrey D. Turner, Ph.^ 

Title: President and Chief Executive Officer 



Address: Nexia Biotechnologies, Inc., 21.111 Lakeshore Road, Ste. Anne De Bellevue, Quebec, Canada 



